DISTRIBUTION  STATEMENT  A:  Distribution  approved  for  public  release;  distribution  is  unlimited. 


Bayesian  Hierarchical  Model  Characterization  of  Model  Error  in  Ocean  Data 

Assimilation  and  Forecasts 

Ralph  F.  Milliff 

Colorado  Research  Associates  Division,  NWRA 
3380  Mitchell  Lane 
Boulder,  CO  80301 

phone:  (303)  415-9701  fax:  (303)  415-9702  email:  milliff@cora.nwra.com 

Christopher  K.  Wikle 

Department  of  Statistics,  University  of  Missouri 
146  Middlebush 
Columbia,  MO  6521 1 

phone:  (573)  882-9659  fax:  (573)  884-5524  email:  wikle@stat.missouri.edu 

L.  Mark  Berliner  and  Radu  Herbei 
Department  of  Statistics,  The  Ohio  State  University 
1958  Neil  Ave. 

Columbus,  OH  43210 

phone:  (614)  292-0291  fax:  (614)  292-2096  email:  mb@stat.osu.edu 


Award  Number:  N00014-10-C-0354 


LONG-TERM  GOALS 

We  seek  to  focus  quantitative  uncertainty  management  attributes  of  the  Bayesian  Hierarchical  Model 
(BHM)  methodology  on  the  identification,  characterization,  and  evolution  of  irreducible  model  error  in 
ocean  data  assimilation  and  forecast  systems. 

OBJECTIVES 

Our  project  objectives  are  designed  to  build  upon  experience  gained  under  prior  Office  of  Naval 
Research  (ONR)  support.  This  annual  report  describes  progress  attained  in  projects  led  by  PI  Milliff  in 
the  first  full  year  of  funding.  First  year  results  were  also  presented  at  a  project  workshop  held  at  the 
Courant  Institute  for  Mathematical  Sciences,  New  York  University,  in  November  2011.  Objectives 
addressed  in  this  annual  report  focus  on  extensions  of  a  time-  and  space-dependent  vertical  error 
covariance  BHM  from  the  Mediterranean  Forecast  System  (MFS)  to  the  Regional  Ocean  Model  System 
(ROMS)  applications  in  the  California  Current  System  (CCS). 
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APPROACH 


Time-Varying  Error  Covariance  Models:  Review 

During  the  data  assimilation  step  in  an  ocean  forecast  system,  the  forward  model  trajectory  is  adjusted 
by  available  observations.  The  degree  of  adjustment  is  a  function  of  uncertainties  in  the  ocean  state 
vector  estimates  from  the  model  and  observational  uncertainties.  These  uncertainties  are  quantified  in 
Error  Covariance  matrices  in  the  cost  function  minimization  used  to  obtain  an  “optimal”  adjustment  of 
the  model  trajectory  given  the  data.  A  typical  cost  function  is  given  by: 

/=  isxrB_1Sx+i[H(Sx)-d]rR_1[H(5x)-d];  (1) 

where  5x  =  x  —  x/,  for  state  vector  x  and  background  state  vector  xh,  d  =  H(x)  —  y  are  misfits  for 
observation  operator  H  and  observations  y.  The  matrices  B  and  R  are  the  background  and  observation 
error  covariance  matrices,  respectively.  The  full  background  error  covariance  B  =  WT  for  a  sequence 
of  operators  V  =  VdVmvV/!V//Vv  for  diffusion  (D),  nonlinear  advection  (uv),  surface  height  (/?), 
horizontal  ( H )  and  vertical  (V)  variability. 

The  ultimate  object  of  a  time-varying  background  error  covariance  BHM  is  a  space-time  model  for  B. 
For  purposes  of  our  initial  developments  we  focus  on  only  the  vertical  variability.  So  for  now  let, 

B  =  Vy  Vy,  and  <5(x)  =  [<5T(z),  tjS(z)]7',  for  temperature  and  salinity  profiles,  T (z)  and  S(z). 

As  noted  in  Berliner  et  al.  (2003),  and  further  codified  in  Cressie  and  Wikle  (201 1),  BHM  construction 
can  be  partitioned  into  data  stage,  process  model  and  parameter  model  distributions  for  the  components 
on  the  righthand  side  of  Bayes  Theorem.  Gibbs  sampling  algorithms  are  developed  to  estimate  posterior 
distributions  of  interest  given  these  components.  Estimates  of  the  posterior  distribution  for  processes 
(e.g.  et  below)  and  parameters  (e.g.  B/  below)  are  obtained  and  analyzed. 

Let  t,  be  a  space-time  variable  ocean  forecast  model  error  process.  Relying  on  previous  experience  with 
basis  function  expansions  and  random,  time-dependent  amplitude  coefficients  (e.g.  Wikle  et  al.  (2001); 
Milliff  et  al.  (201 1)),  we  pose  the  process  model: 

e,  =  <F/3r  +  rjf  (2) 

where  <£>  are  a  truncated  set  of  vertical  EOF  bases,  /3,  are  time-dependent  amplitudes,  and 
rjt  ~  Gau(().  6^1)  account  for  additional  uncertainty,  such  as  that  arising  from  the  dimension  reduction 
due  to  truncation  of  the  basis  function  set.  Critically,  we  assume  that  ~  Gau( 0,Bf),  where  B,  is  the 
time-dependent  background  error  covariance  matrix  of  interest  in  the  MFS  context. 

The  data  stage  inputs  to  the  error  covariance  BHM  are  model  misfits  d,  noted  above  and  anomalies  qt, 
where  the  anomalies  are  daily  departures  from  the  model  “year-day”  climatologies;  i.e.  qt  =  xt\t_x  —  x 
for  the  state  vector  climatology  xt  for  a  given  forecast  model. 

The  original  error  covariance  BHM  developments,  supported  by  ONR,  were  implemented  in  the  MFS 
where  Bt  is  the  focus.  Impact  studies  for  the  MFS  implementation  of  B,  are  in  progress  now  and  will  be 
reported  and  published  in  the  coming  year.  Current  ONR  funding  supports  the  extension  of  the  error 
covariance  BHM  to  the  CCS  domain  of  the  ROMS  ocean  forecast  system  implemented  at  UCSC.  Here, 
our  primary  interest  is  in  the  error  process  model  itself;  i.e.  ef  as  defined  in  (2). 
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Figure  1 :  California  Current  System  domain  and  data  coverage  by  sub-region.  Sub-regions  include  coastal 
(CoC),  northern  and  southern  abyssal  plains  (NAP,  SAP).  Data  include  temperature  and  salinity  profiles. 
Subsets  of  the  full  dataset  are  shown  at  right.  The  data  subsets  have  been  used  in  the  development  and  first 
implementations  of  the  Error  Covariance  BHM  in  the  CCS. 


WORK  COMPLETED 

The  CCS  domain  is  partitioned  into  3  sub-regions  as  shown  in  Figure  1 .  The  coastal  domain  (CoC) 
contains  continental  shelf  regions.  Offshore  domains  are  separated  in  the  vicinity  of  the  Mendocino 
escarpment.  They  are  denoted  the  Northern  and  Southern  abyssal  plain  sub-regions  (NAP  and  SAP 
respectively).  Fig  1  also  shows  the  distribution  of  temperature  and  salinity  profiles  for  CCS  by 
sub-regions.  Subsets  of  these  data  (Fig.  1 ;  right)  were  used  in  the  first  developments  and 
implementations  of  the  error  covariance  BHM  in  the  CCS. 

Figure  2  shows  time  vs.  depth  aggregates  of  the  temperature  profile  data  for  the  coastal  (CoC;  top)  and 
N.  abyssal  plain  (NAP;  bottom)  data  subsets.  The  misfit  data,  d  (left  panels),  are  only  available  at 
measured  profile  locations  and  times,  while  the  anomaly  data  q  (right  panels)  are  available  for  every 
year-day  in  the  forecast  period. 

The  first  error  process  model  estimates,  for  subregions  of  the  CCS,  are  described  below. 

Relevant  Meetings  and  Presentations 

(Edwards,  Milliff,  Moore)  Experimental  design  considerations,  on-site  visit  with  collaborators  at  Univ. 
California,  Santa  Cruz;  Februrary,  2011. 

(Herbei,  Milliff,  Wikle)  Error  covariance  test  case  discussions,  Advection-Diffusion  model  priors  for 
Model  Error,  Herbei,  Wikle  visit  to  NWRA/CoRA;  June  2010. 

(Edwards,  Milliff,  Moore)  Model  Error  in  4DVar  discussions,  on-site  visit  with  collaborators  at  Univ. 
California,  Santa  Cruz;  July,  2011. 

(Berliner,  Herbei,  Milliff,  Moore,  Wikle)  Informal  presentations  and  discussions  at  the  annual 


3 


Depth  (m)  Depth  (m) 


Misfit  (d):  Temperature 


Time  (Serial) 
Misfit  (d):  Temperature 


Time  (Serial) 


Climate  Anomaly  (q):  Temperature 


1.5 

1 

0.5 

0 


-0.5 


-1 


-1.5 


2 

1.5 

1 

0.5 

0 


-0.5 


-1 


Figure  2:  Temperature  profile  misfit  ( left  panels)  and  anomaly  ( right  panels)  data  stage  inputs  used  for  the 
development  of  a  time-dependent  error  covariance  BHM  application  in  the  California  Current  System  region. 
The  BHM  data  stage  inputs  are  shown  for  the  coastal  (CoC;  top)  and  N.  abyssal  plain  (NAP;  bottom) 
sub-domains.  Data  stage  inputs  for  salinity  are  not  shown. 
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“All-Hands”  project  meeting  at  NWRA/CoRA,  August,  2011. 

(Milliff)  Session  leader;  Stochastic  forecasting  and  relation  to  Data  Assimilation  at,  Guiding  the 
Extension  of  Navy  Operational  Ocean  Data  Assimilation  and  Prediction,  NRL  SSC  sponsored  meeting 
atUniv.  Maryland;  September,  2011 

(Herbei,  Milliff,  Moore)  Presentations  at  ONR  Model  Error  Project  Meeting,  New  York  Univ.; 
November,  2011. 

RESULTS 

Posterior  mean  time-dependent  error,  for  errors  manifest  in  temperature  and  salinity  (not  shown) 
profiles,  are  computed  for  the  CCS  domain  of  the  ROMS  4dVar  data  assimilation  and  ocean  forecast 
system.  Data  stage  inputs  from  ROMS  4dVar  misfits  and  the  ROMS  climatology  for  the  region  are 
convolved  with  basis  function  process  models  (2)  as  described  above.  Uncertainty  in  the  error  process  is 
also  time  dependent  and  can  be  represented  by  time  series  of  standard  deviations  in  posterior 
distributions  from  the  BHM. 

Figure  3  depicts  depth  vs.  time  displays  of  the  error  process  for  temperature,  er(t)  for  the  coastal  and 
N.  abyssal  plain  sub-regions  of  the  CCS.  Time  series  for  standard  deviation  of  the  error  process  are 
shown  at  right  (Fig.  3)  as  computed  from  the  spread  in  the  posterior  distributions.  The  experimental 
domain  focuses  on  the  upper  400m  in  each  sub-region;  for  a  175  d  period,  treated  in  5 d  epochs  so  that 
data  stage  influences  can  be  maximized. 

IMPACT/APPLICATIONS 

The  research  overlapping  the  ONR  project  to  use  BHM  to  augment  MFS,  with  the  initial  year  of  the 
ONR  model  error  project  demonstrates  practical  methods  to  add  time-  and  space-dependence  to  error 
process  and  error  covariance  representations  in  operational  (MFS)  and  near-operational  (ROMS-4dVar) 
ocean  forecast  systems.  Refining  estimates  of  the  time-dependent  changes  in  forecast  uncertainty  across 
regime  shifts  adds  value  to  ocean  forecast  system  output. 

TRANSITIONS 

The  Bayesian  Confab  meetings  in  Boulder  every  August  are  adding  Irreducible  Model  Error  foci  in  the 
informal  presentations  and  discussions  that  characterize  the  meetings. 

Informal  communications  with  scientists  in  the  Ocean  Modelling  branch  of  the  Naval  Research 
Laboratory,  Bay  St.  Louis,  MI  have  carried  over  from  the  ONR  MFS  project. 

RELATED  PROJECTS 

“Bayesian  Hierarchical  Models  to  Augment  the  Mediterranean  Forecast  System”,  ONR  Physical 
Oceanography  Program,  May  2009  -  May  2011. 

“Estimating  Ecosystem  Model  Uncertainties  in  Pan-Regional  Syntheses  and  Climate  Change  Impacts 
on  Coastal  Domains  of  the  North  Pacific  Ocean”,  NSF  US  Globec  Program,  October  2008  -  October 
2011. 
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Figure  3:  Posterior  distribution  temperature  error  process  ej(t)  ( left  panels ),  and  temperature  error  process 
uncertainty  (right  panels)  in  a  time  vs.  depth  format  for  coastal  (CoC;  top )  and  N.  abyssal  plain  (NAP;  bottom ) 
sub-regions.  Error  process  uncertainty  is  expressed  in  standard  deviation  er(t).  Error  process  and  uncertainty 
profiles  are  shown  for  a  175-day  experiment  spanning  the  period  of  greatest  observational  density  in  the  data 

stage  input  records  (Fig.  2).  Units  are  °C. 
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“Quantifying  the  Amplitude,  Structure  and  Influence  of  Model  Error  during  Ocean  Analysis  and 
Forecast  Cycles”,  ONR  Physical  Oceanography  Program,  A.  Moore  (PI). 
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and  Pinardi  et  al.  (201 1),  benefited  from  the  continuing  ONR  support  for  BHM  developments.  See 
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